Dataset statistics
| Number of variables | 24 |
|---|---|
| Number of observations | 100000 |
| Missing cells | 36306 |
| Missing cells (%) | 1.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 23.8 MiB |
| Average record size in memory | 249.9 B |
Variable types
| DateTime | 3 |
|---|---|
| Numeric | 17 |
| Categorical | 1 |
| Boolean | 3 |
posa_continent is highly overall correlated with site_name | High correlation |
site_name is highly overall correlated with posa_continent | High correlation |
srch_destination_id is highly overall correlated with srch_destination_type_id | High correlation |
srch_destination_type_id is highly overall correlated with srch_destination_id | High correlation |
is_booking is highly imbalanced (60.5%) | Imbalance |
orig_destination_distance has 36068 (36.1%) missing values | Missing |
user_location_region has 1329 (1.3%) zeros | Zeros |
channel has 12462 (12.5%) zeros | Zeros |
srch_children_cnt has 78966 (79.0%) zeros | Zeros |
hotel_continent has 1836 (1.8%) zeros | Zeros |
Reproduction
| Analysis started | 2024-03-21 07:08:49.599326 |
|---|---|
| Analysis finished | 2024-03-21 07:12:17.408294 |
| Duration | 3 minutes and 27.81 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
date_time
Date
| Distinct | 99883 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Minimum | 2013-01-07 00:00:28 |
|---|---|
| Maximum | 2014-12-31 23:51:39 |
site_name
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 42 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.85013 |
| Minimum | 2 |
|---|---|
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 878.9 KiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 2 |
| Q3 | 15 |
| 95-th percentile | 37 |
| Maximum | 53 |
| Range | 51 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 12.000909 |
|---|---|
| Coefficient of variation (CV) | 1.2183503 |
| Kurtosis | 0.054600408 |
| Mean | 9.85013 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.2457674 |
| Sum | 985013 |
| Variance | 144.02181 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 62940 | |
| 11 | 6951 | 7.0% |
| 24 | 6343 | 6.3% |
| 37 | 5358 | 5.4% |
| 34 | 4802 | 4.8% |
| 8 | 2494 | 2.5% |
| 23 | 1934 | 1.9% |
| 13 | 1809 | 1.8% |
| 17 | 1038 | 1.0% |
| 18 | 700 | 0.7% |
| Other values (32) | 5631 | 5.6% |
| Value | Count | Frequency (%) |
| 2 | 62940 | |
| 6 | 42 | < 0.1% |
| 7 | 104 | 0.1% |
| 8 | 2494 | 2.5% |
| 9 | 97 | 0.1% |
| 10 | 261 | 0.3% |
| 11 | 6951 | 7.0% |
| 13 | 1809 | 1.8% |
| 14 | 146 | 0.1% |
| 15 | 232 | 0.2% |
| Value | Count | Frequency (%) |
| 53 | 28 | < 0.1% |
| 48 | 47 | < 0.1% |
| 47 | 3 | < 0.1% |
| 46 | 122 | |
| 45 | 7 | < 0.1% |
| 44 | 6 | < 0.1% |
| 43 | 20 | < 0.1% |
| 41 | 3 | < 0.1% |
| 40 | 125 | |
| 38 | 31 | < 0.1% |
posa_continent
Categorical
HIGH CORRELATION 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 6.3 MiB |
| 3 | |
|---|---|
| 1 | |
| 2 | |
| 4 | 3081 |
| 0 | 750 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 100000 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 3 |
| 3rd row | 3 |
| 4th row | 3 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 3 | 74843 | |
| 1 | 11905 | 11.9% |
| 2 | 9421 | 9.4% |
| 4 | 3081 | 3.1% |
| 0 | 750 | 0.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3 | 74843 | |
| 1 | 11905 | 11.9% |
| 2 | 9421 | 9.4% |
| 4 | 3081 | 3.1% |
| 0 | 750 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 74843 | |
| 1 | 11905 | 11.9% |
| 2 | 9421 | 9.4% |
| 4 | 3081 | 3.1% |
| 0 | 750 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 100000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 74843 | |
| 1 | 11905 | 11.9% |
| 2 | 9421 | 9.4% |
| 4 | 3081 | 3.1% |
| 0 | 750 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 100000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 74843 | |
| 1 | 11905 | 11.9% |
| 2 | 9421 | 9.4% |
| 4 | 3081 | 3.1% |
| 0 | 750 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 100000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 74843 | |
| 1 | 11905 | 11.9% |
| 2 | 9421 | 9.4% |
| 4 | 3081 | 3.1% |
| 0 | 750 | 0.8% |
user_location_country
Real number (ℝ)
| Distinct | 195 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 86.28119 |
| Minimum | 0 |
|---|---|
| Maximum | 239 |
| Zeros | 458 |
| Zeros (%) | 0.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 878.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 66 |
| median | 66 |
| Q3 | 71 |
| 95-th percentile | 205 |
| Maximum | 239 |
| Range | 239 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 59.394285 |
|---|---|
| Coefficient of variation (CV) | 0.68838045 |
| Kurtosis | 0.27359659 |
| Mean | 86.28119 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.1249602 |
| Sum | 8628119 |
| Variance | 3527.681 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 66 | 53775 | |
| 205 | 11211 | 11.2% |
| 3 | 5917 | 5.9% |
| 69 | 5181 | 5.2% |
| 77 | 2506 | 2.5% |
| 1 | 2038 | 2.0% |
| 46 | 1904 | 1.9% |
| 215 | 1335 | 1.3% |
| 133 | 1120 | 1.1% |
| 23 | 825 | 0.8% |
| Other values (185) | 14188 | 14.2% |
| Value | Count | Frequency (%) |
| 0 | 458 | 0.5% |
| 1 | 2038 | 2.0% |
| 3 | 5917 | |
| 4 | 14 | < 0.1% |
| 5 | 191 | 0.2% |
| 6 | 32 | < 0.1% |
| 8 | 9 | < 0.1% |
| 10 | 29 | < 0.1% |
| 11 | 15 | < 0.1% |
| 12 | 308 | 0.3% |
| Value | Count | Frequency (%) |
| 239 | 18 | < 0.1% |
| 238 | 9 | < 0.1% |
| 237 | 5 | < 0.1% |
| 235 | 170 | 0.2% |
| 234 | 9 | < 0.1% |
| 233 | 2 | < 0.1% |
| 231 | 494 | |
| 230 | 155 | 0.2% |
| 229 | 113 | 0.1% |
| 228 | 23 | < 0.1% |
user_location_region
Real number (ℝ)
ZEROS 
| Distinct | 768 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 307.87382 |
| Minimum | 0 |
|---|---|
| Maximum | 1027 |
| Zeros | 1329 |
| Zeros (%) | 1.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 976.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 38 |
| Q1 | 174 |
| median | 312 |
| Q3 | 385 |
| 95-th percentile | 790 |
| Maximum | 1027 |
| Range | 1027 |
| Interquartile range (IQR) | 211 |
Descriptive statistics
| Standard deviation | 208.85329 |
|---|---|
| Coefficient of variation (CV) | 0.67837302 |
| Kurtosis | 1.5442031 |
| Mean | 307.87382 |
| Median Absolute Deviation (MAD) | 136 |
| Skewness | 1.151004 |
| Sum | 30787382 |
| Variance | 43619.698 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 174 | 10915 | 10.9% |
| 348 | 4850 | 4.9% |
| 354 | 4483 | 4.5% |
| 442 | 3752 | 3.8% |
| 220 | 3626 | 3.6% |
| 50 | 2830 | 2.8% |
| 462 | 2725 | 2.7% |
| 155 | 2361 | 2.4% |
| 135 | 2255 | 2.3% |
| 258 | 2091 | 2.1% |
| Other values (758) | 60112 |
| Value | Count | Frequency (%) |
| 0 | 1329 | |
| 1 | 3 | < 0.1% |
| 2 | 4 | < 0.1% |
| 3 | 23 | < 0.1% |
| 4 | 9 | < 0.1% |
| 5 | 14 | < 0.1% |
| 6 | 15 | < 0.1% |
| 7 | 15 | < 0.1% |
| 8 | 3 | < 0.1% |
| 9 | 105 | 0.1% |
| Value | Count | Frequency (%) |
| 1027 | 2 | < 0.1% |
| 1023 | 1 | < 0.1% |
| 1021 | 2 | < 0.1% |
| 1017 | 32 | < 0.1% |
| 1016 | 23 | < 0.1% |
| 1014 | 1 | < 0.1% |
| 1013 | 3 | < 0.1% |
| 1011 | 81 | |
| 1010 | 47 | |
| 1008 | 2 | < 0.1% |
user_location_city
Real number (ℝ)
| Distinct | 10840 |
|---|---|
| Distinct (%) | 10.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27628.528 |
| Minimum | 0 |
|---|---|
| Maximum | 56507 |
| Zeros | 3 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 976.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2140 |
| Q1 | 12825 |
| median | 27462 |
| Q3 | 42328 |
| 95-th percentile | 53274 |
| Maximum | 56507 |
| Range | 56507 |
| Interquartile range (IQR) | 29503 |
Descriptive statistics
| Standard deviation | 16752.858 |
|---|---|
| Coefficient of variation (CV) | 0.60636085 |
| Kurtosis | -1.2564649 |
| Mean | 27628.528 |
| Median Absolute Deviation (MAD) | 14866 |
| Skewness | 0.0082890499 |
| Sum | 2.7628528 × 109 |
| Variance | 2.8065825 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5703 | 2083 | 2.1% |
| 48862 | 1585 | 1.6% |
| 25315 | 1116 | 1.1% |
| 24103 | 1097 | 1.1% |
| 36086 | 950 | 0.9% |
| 2086 | 746 | 0.7% |
| 14703 | 745 | 0.7% |
| 35390 | 685 | 0.7% |
| 41949 | 669 | 0.7% |
| 4687 | 645 | 0.6% |
| Other values (10830) | 89679 |
| Value | Count | Frequency (%) |
| 0 | 3 | < 0.1% |
| 1 | 1 | < 0.1% |
| 3 | 23 | |
| 14 | 6 | < 0.1% |
| 18 | 1 | < 0.1% |
| 21 | 3 | < 0.1% |
| 25 | 1 | < 0.1% |
| 32 | 9 | < 0.1% |
| 40 | 11 | |
| 45 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 56507 | 6 | |
| 56506 | 1 | < 0.1% |
| 56498 | 1 | < 0.1% |
| 56497 | 6 | |
| 56495 | 1 | < 0.1% |
| 56494 | 1 | < 0.1% |
| 56492 | 8 | |
| 56491 | 1 | < 0.1% |
| 56488 | 1 | < 0.1% |
| 56480 | 1 | < 0.1% |
orig_destination_distance
Real number (ℝ)
MISSING 
| Distinct | 62218 |
|---|---|
| Distinct (%) | 97.3% |
| Missing | 36068 |
| Missing (%) | 36.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1965.4872 |
| Minimum | 0.0055999998 |
|---|---|
| Maximum | 11766.433 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 0.0055999998 |
|---|---|
| 5-th percentile | 43.678089 |
| Q1 | 313.18181 |
| median | 1131.7756 |
| Q3 | 2543.5016 |
| 95-th percentile | 6869.7212 |
| Maximum | 11766.433 |
| Range | 11766.427 |
| Interquartile range (IQR) | 2230.3198 |
Descriptive statistics
| Standard deviation | 2233.1101 |
|---|---|
| Coefficient of variation (CV) | 1.1361611 |
| Kurtosis | 2.155371 |
| Mean | 1965.4872 |
| Median Absolute Deviation (MAD) | 940.74377 |
| Skewness | 1.6088099 |
| Sum | 1.2565753 × 108 |
| Variance | 4986781 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1036.196411 | 9 | < 0.1% |
| 1953.148071 | 7 | < 0.1% |
| 226.7472992 | 7 | < 0.1% |
| 227.2722931 | 6 | < 0.1% |
| 0.0055999998 | 6 | < 0.1% |
| 43.26950073 | 6 | < 0.1% |
| 65.15660095 | 5 | < 0.1% |
| 1036.575928 | 5 | < 0.1% |
| 227.392807 | 5 | < 0.1% |
| 2235.612305 | 5 | < 0.1% |
| Other values (62208) | 63871 | |
| (Missing) | 36068 |
| Value | Count | Frequency (%) |
| 0.0055999998 | 6 | |
| 0.01710000075 | 1 | < 0.1% |
| 0.02280000038 | 1 | < 0.1% |
| 0.02979999967 | 1 | < 0.1% |
| 0.04030000046 | 1 | < 0.1% |
| 0.04149999842 | 1 | < 0.1% |
| 0.04540000111 | 1 | < 0.1% |
| 0.04659999907 | 1 | < 0.1% |
| 0.05600000173 | 1 | < 0.1% |
| 0.06750000268 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 11766.43262 | 1 | |
| 11669.33691 | 1 | |
| 11662.52832 | 1 | |
| 11646.31934 | 1 | |
| 11635.64844 | 1 | |
| 11634.80664 | 1 | |
| 11632.24707 | 1 | |
| 11631.99805 | 1 | |
| 11631.92383 | 1 | |
| 11631.54395 | 1 |
user_id
Real number (ℝ)
| Distinct | 89008 |
|---|---|
| Distinct (%) | 89.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 605671.12 |
| Minimum | 5 |
|---|---|
| Maximum | 1198761 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.1 MiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 58909.25 |
| Q1 | 299868 |
| median | 606661.5 |
| Q3 | 911002.5 |
| 95-th percentile | 1149283.3 |
| Maximum | 1198761 |
| Range | 1198756 |
| Interquartile range (IQR) | 611134.5 |
Descriptive statistics
| Standard deviation | 350453.24 |
|---|---|
| Coefficient of variation (CV) | 0.57861971 |
| Kurtosis | -1.2189668 |
| Mean | 605671.12 |
| Median Absolute Deviation (MAD) | 305554.5 |
| Skewness | -0.014230223 |
| Sum | 6.0567112 × 1010 |
| Variance | 1.2281748 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 414572 | 6 | < 0.1% |
| 515280 | 6 | < 0.1% |
| 269580 | 6 | < 0.1% |
| 9614 | 5 | < 0.1% |
| 509838 | 5 | < 0.1% |
| 612161 | 5 | < 0.1% |
| 843234 | 5 | < 0.1% |
| 228821 | 5 | < 0.1% |
| 1148446 | 5 | < 0.1% |
| 1135501 | 5 | < 0.1% |
| Other values (88998) | 99947 |
| Value | Count | Frequency (%) |
| 5 | 1 | |
| 34 | 1 | |
| 65 | 1 | |
| 81 | 1 | |
| 99 | 1 | |
| 107 | 1 | |
| 113 | 1 | |
| 115 | 1 | |
| 120 | 1 | |
| 130 | 1 |
| Value | Count | Frequency (%) |
| 1198761 | 1 | |
| 1198751 | 1 | |
| 1198706 | 1 | |
| 1198680 | 1 | |
| 1198665 | 1 | |
| 1198663 | 1 | |
| 1198642 | 1 | |
| 1198640 | 1 | |
| 1198635 | 1 | |
| 1198626 | 1 |
is_mobile
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 878.9 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 86631 | |
| True | 13369 | 13.4% |
is_package
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 878.9 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 75146 | |
| True | 24854 | 24.9% |
channel
Real number (ℝ)
ZEROS 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.87375 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 12462 |
| Zeros (%) | 12.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 878.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 9 |
| Q3 | 9 |
| 95-th percentile | 9 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 7 |
Descriptive statistics
| Standard deviation | 3.7182831 |
|---|---|
| Coefficient of variation (CV) | 0.63303394 |
| Kurtosis | -1.5376652 |
| Mean | 5.87375 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | -0.50086065 |
| Sum | 587375 |
| Variance | 13.825629 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9 | 55498 | |
| 0 | 12462 | 12.5% |
| 1 | 10100 | 10.1% |
| 2 | 7813 | 7.8% |
| 5 | 6048 | 6.0% |
| 3 | 4609 | 4.6% |
| 4 | 2135 | 2.1% |
| 7 | 856 | 0.9% |
| 8 | 295 | 0.3% |
| 6 | 158 | 0.2% |
| Value | Count | Frequency (%) |
| 0 | 12462 | 12.5% |
| 1 | 10100 | 10.1% |
| 2 | 7813 | 7.8% |
| 3 | 4609 | 4.6% |
| 4 | 2135 | 2.1% |
| 5 | 6048 | 6.0% |
| 6 | 158 | 0.2% |
| 7 | 856 | 0.9% |
| 8 | 295 | 0.3% |
| 9 | 55498 |
| Value | Count | Frequency (%) |
| 10 | 26 | < 0.1% |
| 9 | 55498 | |
| 8 | 295 | 0.3% |
| 7 | 856 | 0.9% |
| 6 | 158 | 0.2% |
| 5 | 6048 | 6.0% |
| 4 | 2135 | 2.1% |
| 3 | 4609 | 4.6% |
| 2 | 7813 | 7.8% |
| 1 | 10100 | 10.1% |
srch_ci
Date
| Distinct | 1066 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 119 |
| Missing (%) | 0.1% |
| Memory size | 1.5 MiB |
| Minimum | 2013-01-07 00:00:00 |
|---|---|
| Maximum | 2016-01-20 00:00:00 |
srch_co
Date
| Distinct | 1069 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 119 |
| Missing (%) | 0.1% |
| Memory size | 1.5 MiB |
| Minimum | 2013-01-08 00:00:00 |
|---|---|
| Maximum | 2016-01-26 00:00:00 |
srch_adults_cnt
Real number (ℝ)
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.02116 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 156 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 878.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.90447799 |
|---|---|
| Coefficient of variation (CV) | 0.4475044 |
| Kurtosis | 9.4779836 |
| Mean | 2.02116 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.3179726 |
| Sum | 202116 |
| Variance | 0.81808044 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 65945 | |
| 1 | 21334 | 21.3% |
| 3 | 5297 | 5.3% |
| 4 | 5268 | 5.3% |
| 6 | 859 | 0.9% |
| 5 | 752 | 0.8% |
| 8 | 216 | 0.2% |
| 0 | 156 | 0.2% |
| 7 | 135 | 0.1% |
| 9 | 38 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 156 | 0.2% |
| 1 | 21334 | 21.3% |
| 2 | 65945 | |
| 3 | 5297 | 5.3% |
| 4 | 5268 | 5.3% |
| 5 | 752 | 0.8% |
| 6 | 859 | 0.9% |
| 7 | 135 | 0.1% |
| 8 | 216 | 0.2% |
| 9 | 38 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 38 | < 0.1% |
| 8 | 216 | 0.2% |
| 7 | 135 | 0.1% |
| 6 | 859 | 0.9% |
| 5 | 752 | 0.8% |
| 4 | 5268 | 5.3% |
| 3 | 5297 | 5.3% |
| 2 | 65945 | |
| 1 | 21334 | 21.3% |
| 0 | 156 | 0.2% |
srch_children_cnt
Real number (ℝ)
ZEROS 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.33202 |
| Minimum | 0 |
|---|---|
| Maximum | 9 |
| Zeros | 78966 |
| Zeros (%) | 79.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 878.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.7297863 |
|---|---|
| Coefficient of variation (CV) | 2.1980191 |
| Kurtosis | 7.8084913 |
| Mean | 0.33202 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.5342589 |
| Sum | 33202 |
| Variance | 0.53258805 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 78966 | |
| 1 | 11280 | 11.3% |
| 2 | 7995 | 8.0% |
| 3 | 1290 | 1.3% |
| 4 | 366 | 0.4% |
| 5 | 46 | < 0.1% |
| 6 | 37 | < 0.1% |
| 7 | 16 | < 0.1% |
| 9 | 2 | < 0.1% |
| 8 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 78966 | |
| 1 | 11280 | 11.3% |
| 2 | 7995 | 8.0% |
| 3 | 1290 | 1.3% |
| 4 | 366 | 0.4% |
| 5 | 46 | < 0.1% |
| 6 | 37 | < 0.1% |
| 7 | 16 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 2 | < 0.1% |
| 8 | 2 | < 0.1% |
| 7 | 16 | < 0.1% |
| 6 | 37 | < 0.1% |
| 5 | 46 | < 0.1% |
| 4 | 366 | 0.4% |
| 3 | 1290 | 1.3% |
| 2 | 7995 | 8.0% |
| 1 | 11280 | 11.3% |
| 0 | 78966 |
srch_rm_cnt
Real number (ℝ)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.11061 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 878.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.4533183 |
|---|---|
| Coefficient of variation (CV) | 0.40817056 |
| Kurtosis | 70.710453 |
| Mean | 1.11061 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.9115043 |
| Sum | 111061 |
| Variance | 0.20549748 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 91814 | |
| 2 | 6521 | 6.5% |
| 3 | 1092 | 1.1% |
| 4 | 274 | 0.3% |
| 5 | 137 | 0.1% |
| 8 | 78 | 0.1% |
| 6 | 50 | 0.1% |
| 7 | 32 | < 0.1% |
| 0 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 1 | 91814 | |
| 2 | 6521 | 6.5% |
| 3 | 1092 | 1.1% |
| 4 | 274 | 0.3% |
| 5 | 137 | 0.1% |
| 6 | 50 | 0.1% |
| 7 | 32 | < 0.1% |
| 8 | 78 | 0.1% |
| Value | Count | Frequency (%) |
| 8 | 78 | 0.1% |
| 7 | 32 | < 0.1% |
| 6 | 50 | 0.1% |
| 5 | 137 | 0.1% |
| 4 | 274 | 0.3% |
| 3 | 1092 | 1.1% |
| 2 | 6521 | 6.5% |
| 1 | 91814 | |
| 0 | 2 | < 0.1% |
srch_destination_id
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 8878 |
|---|---|
| Distinct (%) | 8.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14444.046 |
| Minimum | 4 |
|---|---|
| Maximum | 65035 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 976.6 KiB |
Quantile statistics
| Minimum | 4 |
|---|---|
| 5-th percentile | 1691 |
| Q1 | 8267 |
| median | 9147 |
| Q3 | 18788 |
| 95-th percentile | 42630 |
| Maximum | 65035 |
| Range | 65031 |
| Interquartile range (IQR) | 10521 |
Descriptive statistics
| Standard deviation | 11068.115 |
|---|---|
| Coefficient of variation (CV) | 0.76627523 |
| Kurtosis | 3.8498045 |
| Mean | 14444.046 |
| Median Absolute Deviation (MAD) | 2861 |
| Skewness | 1.9022011 |
| Sum | 1.4444046 × 109 |
| Variance | 1.2250317 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8250 | 3524 | 3.5% |
| 8267 | 2691 | 2.7% |
| 8791 | 1665 | 1.7% |
| 8268 | 1393 | 1.4% |
| 8253 | 1341 | 1.3% |
| 8745 | 1314 | 1.3% |
| 8279 | 1162 | 1.2% |
| 11439 | 991 | 1.0% |
| 8260 | 902 | 0.9% |
| 12206 | 895 | 0.9% |
| Other values (8868) | 84122 |
| Value | Count | Frequency (%) |
| 4 | 3 | < 0.1% |
| 8 | 9 | < 0.1% |
| 9 | 1 | < 0.1% |
| 11 | 3 | < 0.1% |
| 14 | 3 | < 0.1% |
| 16 | 2 | < 0.1% |
| 19 | 3 | < 0.1% |
| 21 | 28 | |
| 23 | 1 | < 0.1% |
| 24 | 14 |
| Value | Count | Frequency (%) |
| 65035 | 1 | |
| 65019 | 1 | |
| 64988 | 1 | |
| 64986 | 1 | |
| 64982 | 1 | |
| 64940 | 1 | |
| 64926 | 1 | |
| 64925 | 1 | |
| 64884 | 1 | |
| 64871 | 2 |
srch_destination_type_id
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.57972 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 878.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.1502444 |
|---|---|
| Coefficient of variation (CV) | 0.83351852 |
| Kurtosis | -1.1650403 |
| Mean | 2.57972 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.78875045 |
| Sum | 257972 |
| Variance | 4.623551 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 61883 | |
| 6 | 22413 | 22.4% |
| 3 | 7308 | 7.3% |
| 5 | 4754 | 4.8% |
| 4 | 3305 | 3.3% |
| 8 | 328 | 0.3% |
| 9 | 5 | < 0.1% |
| 7 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 61883 | |
| 3 | 7308 | 7.3% |
| 4 | 3305 | 3.3% |
| 5 | 4754 | 4.8% |
| 6 | 22413 | 22.4% |
| 7 | 4 | < 0.1% |
| 8 | 328 | 0.3% |
| 9 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 9 | 5 | < 0.1% |
| 8 | 328 | 0.3% |
| 7 | 4 | < 0.1% |
| 6 | 22413 | 22.4% |
| 5 | 4754 | 4.8% |
| 4 | 3305 | 3.3% |
| 3 | 7308 | 7.3% |
| 1 | 61883 |
is_booking
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 878.9 KiB |
| False | |
|---|---|
| True | 7789 |
| Value | Count | Frequency (%) |
| False | 92211 | |
| True | 7789 | 7.8% |
cnt
Real number (ℝ)
| Distinct | 32 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.48251 |
| Minimum | 1 |
|---|---|
| Maximum | 44 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 976.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 3 |
| Maximum | 44 |
| Range | 43 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.2028751 |
|---|---|
| Coefficient of variation (CV) | 0.81137741 |
| Kurtosis | 76.06216 |
| Mean | 1.48251 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.9854761 |
| Sum | 148251 |
| Variance | 1.4469086 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 74398 | |
| 2 | 15260 | 15.3% |
| 3 | 5452 | 5.5% |
| 4 | 2193 | 2.2% |
| 5 | 1124 | 1.1% |
| 6 | 614 | 0.6% |
| 7 | 343 | 0.3% |
| 8 | 211 | 0.2% |
| 9 | 125 | 0.1% |
| 10 | 81 | 0.1% |
| Other values (22) | 199 | 0.2% |
| Value | Count | Frequency (%) |
| 1 | 74398 | |
| 2 | 15260 | 15.3% |
| 3 | 5452 | 5.5% |
| 4 | 2193 | 2.2% |
| 5 | 1124 | 1.1% |
| 6 | 614 | 0.6% |
| 7 | 343 | 0.3% |
| 8 | 211 | 0.2% |
| 9 | 125 | 0.1% |
| 10 | 81 | 0.1% |
| Value | Count | Frequency (%) |
| 44 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 28 | 2 | |
| 26 | 2 | |
| 25 | 1 | < 0.1% |
| 24 | 3 | |
| 23 | 3 |
hotel_continent
Real number (ℝ)
ZEROS 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.15373 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 1836 |
| Zeros (%) | 1.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 878.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 2 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.6196059 |
|---|---|
| Coefficient of variation (CV) | 0.5135525 |
| Kurtosis | -0.66575015 |
| Mean | 3.15373 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.79172655 |
| Sum | 315373 |
| Variance | 2.6231233 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 52594 | |
| 6 | 20023 | 20.0% |
| 3 | 13112 | 13.1% |
| 4 | 11464 | 11.5% |
| 0 | 1836 | 1.8% |
| 5 | 971 | 1.0% |
| Value | Count | Frequency (%) |
| 0 | 1836 | 1.8% |
| 2 | 52594 | |
| 3 | 13112 | 13.1% |
| 4 | 11464 | 11.5% |
| 5 | 971 | 1.0% |
| 6 | 20023 | 20.0% |
| Value | Count | Frequency (%) |
| 6 | 20023 | 20.0% |
| 5 | 971 | 1.0% |
| 4 | 11464 | 11.5% |
| 3 | 13112 | 13.1% |
| 2 | 52594 | |
| 0 | 1836 | 1.8% |
hotel_country
Real number (ℝ)
| Distinct | 176 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 81.39069 |
| Minimum | 0 |
|---|---|
| Maximum | 212 |
| Zeros | 151 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 878.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 8 |
| Q1 | 50 |
| median | 50 |
| Q3 | 106 |
| 95-th percentile | 198 |
| Maximum | 212 |
| Range | 212 |
| Interquartile range (IQR) | 56 |
Descriptive statistics
| Standard deviation | 56.144171 |
|---|---|
| Coefficient of variation (CV) | 0.68981072 |
| Kurtosis | -0.18801869 |
| Mean | 81.39069 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | 1.0231865 |
| Sum | 8139069 |
| Variance | 3152.1679 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50 | 47915 | |
| 8 | 5039 | 5.0% |
| 198 | 4679 | 4.7% |
| 105 | 3579 | 3.6% |
| 70 | 3204 | 3.2% |
| 204 | 2810 | 2.8% |
| 182 | 2444 | 2.4% |
| 77 | 2433 | 2.4% |
| 106 | 1765 | 1.8% |
| 144 | 1584 | 1.6% |
| Other values (166) | 24548 |
| Value | Count | Frequency (%) |
| 0 | 151 | 0.2% |
| 1 | 47 | < 0.1% |
| 2 | 13 | < 0.1% |
| 3 | 13 | < 0.1% |
| 4 | 33 | < 0.1% |
| 5 | 851 | 0.9% |
| 7 | 268 | 0.3% |
| 8 | 5039 | |
| 9 | 25 | < 0.1% |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 212 | 16 | < 0.1% |
| 211 | 7 | < 0.1% |
| 210 | 2 | < 0.1% |
| 209 | 1 | < 0.1% |
| 208 | 712 | 0.7% |
| 206 | 42 | < 0.1% |
| 205 | 1 | < 0.1% |
| 204 | 2810 | |
| 203 | 186 | 0.2% |
| 202 | 71 | 0.1% |
hotel_market
Real number (ℝ)
| Distinct | 1867 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 599.93135 |
| Minimum | 0 |
|---|---|
| Maximum | 2117 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 976.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 19 |
| Q1 | 159 |
| median | 592 |
| Q3 | 701 |
| 95-th percentile | 1614 |
| Maximum | 2117 |
| Range | 2117 |
| Interquartile range (IQR) | 542 |
Descriptive statistics
| Standard deviation | 511.41198 |
|---|---|
| Coefficient of variation (CV) | 0.85245083 |
| Kurtosis | 0.16599225 |
| Mean | 599.93135 |
| Median Absolute Deviation (MAD) | 370 |
| Skewness | 0.95903326 |
| Sum | 59993135 |
| Variance | 261542.21 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 628 | 4691 | 4.7% |
| 675 | 4288 | 4.3% |
| 682 | 2288 | 2.3% |
| 19 | 2128 | 2.1% |
| 365 | 2010 | 2.0% |
| 701 | 2005 | 2.0% |
| 110 | 1993 | 2.0% |
| 27 | 1723 | 1.7% |
| 1230 | 1647 | 1.6% |
| 212 | 1376 | 1.4% |
| Other values (1857) | 75851 |
| Value | Count | Frequency (%) |
| 0 | 2 | < 0.1% |
| 1 | 4 | < 0.1% |
| 2 | 859 | |
| 3 | 16 | < 0.1% |
| 4 | 436 | |
| 5 | 211 | 0.2% |
| 6 | 117 | 0.1% |
| 7 | 89 | 0.1% |
| 8 | 244 | 0.2% |
| 9 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 2117 | 10 | < 0.1% |
| 2116 | 1 | < 0.1% |
| 2114 | 1 | < 0.1% |
| 2113 | 3 | < 0.1% |
| 2112 | 2 | < 0.1% |
| 2111 | 30 | |
| 2109 | 2 | < 0.1% |
| 2108 | 5 | < 0.1% |
| 2107 | 9 | < 0.1% |
| 2106 | 2 | < 0.1% |
hotel_cluster
Real number (ℝ)
| Distinct | 100 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49.82681 |
| Minimum | 0 |
|---|---|
| Maximum | 99 |
| Zeros | 969 |
| Zeros (%) | 1.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 878.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 25 |
| median | 49 |
| Q3 | 73 |
| 95-th percentile | 96 |
| Maximum | 99 |
| Range | 99 |
| Interquartile range (IQR) | 48 |
Descriptive statistics
| Standard deviation | 28.923766 |
|---|---|
| Coefficient of variation (CV) | 0.580486 |
| Kurtosis | -1.1503802 |
| Mean | 49.82681 |
| Median Absolute Deviation (MAD) | 24 |
| Skewness | 0.0062483961 |
| Sum | 4982681 |
| Variance | 836.58422 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 91 | 2701 | 2.7% |
| 41 | 2101 | 2.1% |
| 48 | 1981 | 2.0% |
| 64 | 1817 | 1.8% |
| 65 | 1749 | 1.7% |
| 5 | 1648 | 1.6% |
| 98 | 1521 | 1.5% |
| 59 | 1484 | 1.5% |
| 21 | 1455 | 1.5% |
| 83 | 1451 | 1.5% |
| Other values (90) | 82092 |
| Value | Count | Frequency (%) |
| 0 | 969 | |
| 1 | 1193 | |
| 2 | 1146 | |
| 3 | 573 | 0.6% |
| 4 | 897 | |
| 5 | 1648 | |
| 6 | 1013 | |
| 7 | 695 | |
| 8 | 889 | |
| 9 | 1300 |
| Value | Count | Frequency (%) |
| 99 | 1237 | |
| 98 | 1521 | |
| 97 | 1281 | |
| 96 | 1020 | 1.0% |
| 95 | 1374 | |
| 94 | 818 | 0.8% |
| 93 | 568 | 0.6% |
| 92 | 646 | 0.6% |
| 91 | 2701 | |
| 90 | 1107 |
| channel | cnt | hotel_cluster | hotel_continent | hotel_country | hotel_market | is_booking | is_mobile | is_package | orig_destination_distance | posa_continent | site_name | srch_adults_cnt | srch_children_cnt | srch_destination_id | srch_destination_type_id | srch_rm_cnt | user_id | user_location_city | user_location_country | user_location_region | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| channel | 1.000 | -0.014 | 0.008 | -0.031 | -0.018 | 0.017 | 0.040 | 0.101 | 0.094 | 0.001 | 0.156 | -0.062 | -0.037 | 0.013 | -0.004 | 0.018 | 0.004 | 0.002 | 0.027 | 0.080 | 0.014 |
| cnt | -0.014 | 1.000 | 0.002 | 0.021 | -0.006 | -0.012 | 0.036 | 0.007 | 0.095 | 0.035 | 0.005 | 0.017 | 0.033 | 0.036 | -0.013 | -0.011 | 0.006 | 0.001 | 0.000 | -0.000 | -0.005 |
| hotel_cluster | 0.008 | 0.002 | 1.000 | 0.001 | -0.035 | 0.036 | 0.041 | 0.010 | 0.134 | 0.024 | 0.054 | -0.027 | 0.022 | 0.016 | -0.016 | -0.037 | -0.006 | 0.004 | 0.000 | -0.006 | 0.013 |
| hotel_continent | -0.031 | 0.021 | 0.001 | 1.000 | 0.353 | -0.303 | 0.058 | 0.048 | 0.299 | 0.492 | 0.372 | 0.228 | -0.017 | -0.051 | 0.024 | -0.069 | 0.028 | 0.006 | 0.002 | -0.050 | -0.033 |
| hotel_country | -0.018 | -0.006 | -0.035 | 0.353 | 1.000 | -0.104 | 0.046 | 0.038 | 0.234 | 0.189 | 0.235 | 0.306 | -0.036 | -0.039 | 0.068 | -0.023 | 0.010 | 0.012 | -0.011 | 0.044 | -0.081 |
| hotel_market | 0.017 | -0.012 | 0.036 | -0.303 | -0.104 | 1.000 | 0.049 | 0.028 | 0.192 | -0.193 | 0.174 | -0.124 | 0.018 | 0.012 | 0.103 | 0.047 | -0.008 | -0.007 | 0.009 | 0.039 | 0.076 |
| is_booking | 0.040 | 0.036 | 0.041 | 0.058 | 0.046 | 0.049 | 1.000 | 0.032 | 0.075 | -0.057 | 0.023 | -0.012 | -0.070 | -0.019 | 0.023 | 0.043 | 0.008 | 0.003 | -0.001 | 0.006 | 0.006 |
| is_mobile | 0.101 | 0.007 | 0.010 | 0.048 | 0.038 | 0.028 | 0.032 | 1.000 | 0.053 | -0.059 | 0.039 | -0.017 | 0.033 | 0.017 | 0.001 | -0.021 | -0.020 | -0.005 | 0.002 | 0.017 | 0.018 |
| is_package | 0.094 | 0.095 | 0.134 | 0.299 | 0.234 | 0.192 | 0.075 | 0.053 | 1.000 | 0.190 | 0.122 | 0.046 | 0.004 | -0.039 | -0.149 | -0.234 | -0.028 | -0.008 | 0.014 | 0.004 | 0.041 |
| orig_destination_distance | 0.001 | 0.035 | 0.024 | 0.492 | 0.189 | -0.193 | -0.057 | -0.059 | 0.190 | 1.000 | 0.203 | 0.084 | -0.010 | -0.047 | -0.081 | -0.078 | -0.010 | 0.010 | 0.023 | 0.100 | 0.040 |
| posa_continent | 0.156 | 0.005 | 0.054 | 0.372 | 0.235 | 0.174 | 0.023 | 0.039 | 0.122 | 0.203 | 1.000 | -0.638 | 0.033 | 0.034 | 0.024 | 0.049 | -0.031 | -0.011 | 0.066 | 0.215 | 0.160 |
| site_name | -0.062 | 0.017 | -0.027 | 0.228 | 0.306 | -0.124 | -0.012 | -0.017 | 0.046 | 0.084 | -0.638 | 1.000 | -0.013 | -0.016 | 0.007 | -0.011 | 0.009 | 0.026 | -0.020 | 0.235 | -0.015 |
| srch_adults_cnt | -0.037 | 0.033 | 0.022 | -0.017 | -0.036 | 0.018 | -0.070 | 0.033 | 0.004 | -0.010 | 0.033 | -0.013 | 1.000 | 0.119 | 0.007 | -0.025 | 0.426 | -0.005 | 0.005 | 0.047 | 0.022 |
| srch_children_cnt | 0.013 | 0.036 | 0.016 | -0.051 | -0.039 | 0.012 | -0.019 | 0.017 | -0.039 | -0.047 | 0.034 | -0.016 | 0.119 | 1.000 | 0.010 | -0.005 | 0.090 | -0.004 | 0.008 | 0.026 | 0.013 |
| srch_destination_id | -0.004 | -0.013 | -0.016 | 0.024 | 0.068 | 0.103 | 0.023 | 0.001 | -0.149 | -0.081 | 0.024 | 0.007 | 0.007 | 0.010 | 1.000 | 0.518 | 0.008 | 0.003 | 0.005 | 0.012 | 0.021 |
| srch_destination_type_id | 0.018 | -0.011 | -0.037 | -0.069 | -0.023 | 0.047 | 0.043 | -0.021 | -0.234 | -0.078 | 0.049 | -0.011 | -0.025 | -0.005 | 0.518 | 1.000 | 0.007 | 0.011 | -0.001 | 0.025 | 0.007 |
| srch_rm_cnt | 0.004 | 0.006 | -0.006 | 0.028 | 0.010 | -0.008 | 0.008 | -0.020 | -0.028 | -0.010 | -0.031 | 0.009 | 0.426 | 0.090 | 0.008 | 0.007 | 1.000 | 0.004 | -0.002 | -0.003 | -0.010 |
| user_id | 0.002 | 0.001 | 0.004 | 0.006 | 0.012 | -0.007 | 0.003 | -0.005 | -0.008 | 0.010 | -0.011 | 0.026 | -0.005 | -0.004 | 0.003 | 0.011 | 0.004 | 1.000 | -0.005 | -0.018 | -0.010 |
| user_location_city | 0.027 | 0.000 | 0.000 | 0.002 | -0.011 | 0.009 | -0.001 | 0.002 | 0.014 | 0.023 | 0.066 | -0.020 | 0.005 | 0.008 | 0.005 | -0.001 | -0.002 | -0.005 | 1.000 | 0.121 | 0.136 |
| user_location_country | 0.080 | -0.000 | -0.006 | -0.050 | 0.044 | 0.039 | 0.006 | 0.017 | 0.004 | 0.100 | 0.215 | 0.235 | 0.047 | 0.026 | 0.012 | 0.025 | -0.003 | -0.018 | 0.121 | 1.000 | 0.198 |
| user_location_region | 0.014 | -0.005 | 0.013 | -0.033 | -0.081 | 0.076 | 0.006 | 0.018 | 0.041 | 0.040 | 0.160 | -0.015 | 0.022 | 0.013 | 0.021 | 0.007 | -0.010 | -0.010 | 0.136 | 0.198 | 1.000 |
| date_time | site_name | posa_continent | user_location_country | user_location_region | user_location_city | orig_destination_distance | user_id | is_mobile | is_package | channel | srch_ci | srch_co | srch_adults_cnt | srch_children_cnt | srch_rm_cnt | srch_destination_id | srch_destination_type_id | is_booking | cnt | hotel_continent | hotel_country | hotel_market | hotel_cluster | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 11837970 | 2014-08-01 06:09:40 | 2 | 3 | 66 | 196 | 2428 | 206.772995 | 1063336 | False | False | 1 | 2014-11-14 | 2014-11-15 | 2 | 0 | 1 | 7635 | 3 | False | 2 | 2 | 50 | 675 | 56 |
| 19143100 | 2014-06-16 08:28:20 | 2 | 3 | 66 | 348 | 21291 | 4367.375000 | 229874 | False | False | 0 | 2014-08-26 | 2014-08-28 | 3 | 0 | 1 | 17823 | 1 | False | 1 | 6 | 105 | 770 | 36 |
| 739874 | 2013-09-15 14:51:26 | 2 | 3 | 66 | 318 | 28994 | 1971.327637 | 71596 | False | True | 2 | 2013-12-24 | 2013-12-31 | 2 | 2 | 1 | 8254 | 1 | True | 1 | 2 | 50 | 365 | 37 |
| 3827636 | 2014-12-08 19:52:08 | 2 | 3 | 66 | 174 | 18877 | 334.765198 | 290929 | False | False | 9 | 2014-12-12 | 2014-12-13 | 2 | 0 | 1 | 23541 | 6 | False | 1 | 2 | 50 | 660 | 9 |
| 36318421 | 2014-12-15 22:29:27 | 24 | 2 | 3 | 64 | 12805 | NaN | 1075410 | False | False | 4 | 2014-12-27 | 2014-12-29 | 2 | 0 | 1 | 20332 | 1 | True | 1 | 3 | 126 | 264 | 14 |
| 3560318 | 2014-05-12 14:27:12 | 2 | 3 | 66 | 174 | 37675 | 2587.924072 | 838867 | False | False | 1 | 2014-07-13 | 2014-07-19 | 1 | 0 | 1 | 12184 | 6 | False | 1 | 2 | 50 | 690 | 91 |
| 2787821 | 2014-11-16 23:14:16 | 2 | 3 | 66 | 142 | 17440 | 2336.184082 | 1187727 | False | False | 2 | 2014-11-19 | 2014-11-21 | 2 | 0 | 1 | 13237 | 4 | False | 3 | 2 | 50 | 365 | 4 |
| 19701068 | 2014-03-06 07:43:32 | 2 | 3 | 66 | 348 | 53377 | 2428.343262 | 555746 | False | True | 9 | 2014-12-18 | 2014-12-21 | 4 | 0 | 1 | 8824 | 1 | False | 1 | 4 | 8 | 118 | 52 |
| 937308 | 2014-10-29 20:45:57 | 26 | 0 | 215 | 646 | 47902 | NaN | 689139 | False | False | 2 | 2014-12-10 | 2014-12-11 | 2 | 0 | 1 | 13925 | 6 | False | 1 | 4 | 8 | 126 | 44 |
| 4257740 | 2013-06-21 18:38:17 | 24 | 2 | 3 | 50 | 5703 | NaN | 212812 | False | False | 1 | 2013-09-04 | 2013-09-09 | 1 | 0 | 1 | 8799 | 1 | False | 1 | 6 | 204 | 1463 | 35 |
| date_time | site_name | posa_continent | user_location_country | user_location_region | user_location_city | orig_destination_distance | user_id | is_mobile | is_package | channel | srch_ci | srch_co | srch_adults_cnt | srch_children_cnt | srch_rm_cnt | srch_destination_id | srch_destination_type_id | is_booking | cnt | hotel_continent | hotel_country | hotel_market | hotel_cluster | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 14663091 | 2013-02-17 23:44:53 | 2 | 3 | 12 | 752 | 46545 | NaN | 500738 | False | False | 2 | 2013-05-31 | 2013-06-10 | 2 | 1 | 1 | 1725 | 1 | False | 4 | 3 | 182 | 1493 | 57 |
| 26577419 | 2014-07-17 17:06:44 | 2 | 3 | 66 | 220 | 19222 | 205.578903 | 1010517 | False | False | 9 | 2014-07-27 | 2014-08-01 | 1 | 3 | 1 | 8260 | 1 | False | 1 | 2 | 50 | 701 | 5 |
| 2907716 | 2014-10-31 14:45:44 | 26 | 0 | 215 | 646 | 51733 | 1450.634399 | 307336 | False | False | 5 | 2014-12-27 | 2015-01-01 | 2 | 4 | 1 | 11373 | 1 | False | 1 | 4 | 128 | 1455 | 96 |
| 23172655 | 2014-07-25 07:32:52 | 2 | 3 | 66 | 321 | 3263 | 139.231995 | 78779 | False | False | 9 | 2014-08-13 | 2014-08-14 | 1 | 0 | 1 | 411 | 1 | False | 1 | 2 | 50 | 470 | 22 |
| 35485790 | 2014-09-17 19:23:52 | 2 | 3 | 66 | 348 | 47997 | 1585.335205 | 301885 | True | True | 9 | 2014-09-23 | 2014-09-26 | 4 | 0 | 2 | 5405 | 6 | False | 4 | 4 | 8 | 126 | 52 |
| 13757138 | 2013-05-30 21:50:22 | 37 | 1 | 69 | 866 | 54284 | NaN | 240138 | False | False | 9 | 2013-12-01 | 2013-12-11 | 2 | 1 | 1 | 8268 | 1 | False | 1 | 2 | 50 | 682 | 48 |
| 19318762 | 2014-07-22 11:43:54 | 2 | 3 | 66 | 448 | 38023 | 7920.593262 | 710825 | False | False | 9 | 2014-11-20 | 2014-11-21 | 1 | 0 | 1 | 407 | 3 | False | 1 | 5 | 122 | 1462 | 57 |
| 969565 | 2014-06-11 07:57:17 | 2 | 3 | 66 | 448 | 14052 | 365.342804 | 802904 | False | False | 9 | 2014-06-12 | 2014-06-13 | 2 | 0 | 1 | 17456 | 1 | False | 1 | 2 | 50 | 437 | 48 |
| 27324555 | 2014-08-18 16:21:24 | 24 | 2 | 3 | 64 | 12576 | NaN | 774191 | False | False | 2 | 2014-08-20 | 2014-08-23 | 1 | 0 | 1 | 18774 | 1 | False | 1 | 3 | 99 | 1239 | 3 |
| 555923 | 2013-08-04 09:56:17 | 2 | 3 | 66 | 340 | 32193 | 5988.224609 | 685138 | False | False | 9 | 2013-08-29 | 2013-09-02 | 3 | 0 | 1 | 12333 | 5 | True | 1 | 6 | 20 | 2104 | 29 |